IMPLEMENTAT ION OF WEB QUERY LANGUAGES RECONSIDERED beyond tree and single-language algebras
Visions of the next generation Web such as the “Semantic Web” or the “Web 2.0” have triggered the emergence of a multitude of data formats. ¿ese formats have di erent characteristics as far as the shape of data is concerned (for example treevs. graph-shaped). ¿ey are accompanied by a puzzlingly large number of query languages each limited to one data format.¿us, a key feature of theWeb, namely to make it possible to access anything published by anyone, is compromised. ¿is thesis is devoted to versatile query languages capable of accessing data in a variety of Web formats. ¿e issue is addressed from three angles: language design, common, yet uniform semantics, and common, yet uniform evaluation. First, we consider the query language Xcerpt as an example of the advocated class of versatile Web query languages. Using this concrete exemplar allows us to clarify and discuss the vision of versatility in detail. Second, a number of query languages, XPath, XQuery, SPARQL, and Xcerpt, are translated into a common intermediary language, CIQLog. ¿is language has a purely logical semantics, which makes it easily amenable to optimizations. As a side e ect, this provides the, to the best of our knowledge, rst logical semantics for XQuery and SPARQL. It is a very useful tool for understanding the commonalities and di erences of the considered languages. ¿ird, the intermediate logical language is translated into a query algebra, CIQCAG. ¿e core feature of CIQCAG is that it scales from treeto graphshaped data and queries without e ciency losses when tree-data and queries are considered: it is shown that, in these cases, optimal complexities are achieved. CIQCAG is also shown to evaluate each of the aforementioned query languages with a complexity at least as good as the best known evaluation methods so far. For example, navigational XPath is evaluated with space complexityO(q ⋅ d) and time complexityO(q ⋅ n) where q is the query size, n the data size, and d the depth of the (tree-shaped) data. CIQCAG is further shown to provide linear time and space evaluation of tree-shaped queries for a larger class of graph-shaped data than any method previously proposed.¿is larger class of graph-shaped data, called continuous-image graphs, short cigs, is introduced for the rst time in this thesis. A (directed) graph is a cig if its nodes can be totally ordered in such a manner that, for this order, the children of any node form a continuous
منابع مشابه
Querying the Web Reconsidered: Design Principles for Versatile Web Query Languages
A decade of experience with research proposals as well as standardized query languages for the conventional Web and the recent emergence of query languages for the Semantic Web call for a reconsideration of design principles for Web and Semantic Web query languages. This article first argues that a new generation of versatile Web query languages is needed for solving the challenges posed by the...
متن کاملQuerying the Web Reconsidered: A Practical Introduction to Xcerpt
This article gives a practical introduction into the language Xcerpt, guided by many examples for illustrating language constructs and usage. Xcerpt is a rule-based, declarative query and transformation language for XML data. In Xcerpt, queries and the (re-)structuring of answer (also called "constructions") are expressed in terms of patterns instead of path navigations (like in XSLT and XQuery...
متن کاملانتخاب مناسبترین زبان پرسوجو برای استفاده از فراپیوندها جهت استخراج دادهها در حالت دیتالوگ در سامانه پایگاه داده استنتاجی DES
Deductive Database systems are designed based on a logical data model. Data (as opposed to Relational Databases Management System (RDBMS) in which data stored in tables) are saved as facts in a Deductive Database system. Datalog Educational System (DES) is a Deductive Database system that Datalog mode is the default mode in this system. It can extract data to use outer joins with three query la...
متن کاملRule-Based Composite Event Queries: The Language XChange and its Semantics
Reactive Web systems, Web services, and Web-based publish/subscribe systems communicate events as XML messages, and in many cases require composite event detection: it is not sufficient to react to single event messages, but events have to be considered in relation to other events that are received over time. Emphasizing language design and formal semantics, we describe the rule-based query lan...
متن کاملA New Way to Query GISs on the Web
M a y / J u n e 1 9 9 8 I E E E S o f t w a r e 3 1 uery and analysis lie at the heart of a geographic information system.1 The ability to analyze geographical patterns and relationships differentiates GISs from computer cartography, computer-aided design, remote sensing, and database management. Computer scientists have researched relational database query languages for more than a decade, but...
متن کامل